Classification of Stressed Speech Using Physical Parameters Derived from Two-Mass Model
نویسندگان
چکیده
In this study, we investigate physical parameters which can be used to classify speech as either stressed or neutral based on a two-mass vocal fold model. The model attempts to characterize the behavior of the vocal folds and fluid airflow properties when stress is present. The two-mass model is fitted to real speech to estimate the values of physical parameters that represent the stiffness of vocal folds, vocal fold viscosity loss, and subglottal pressure coming from the lungs. The estimated parameters can be used to distinguish stressed speech from neutral speech because these parameters can represent the mechanisms of vocal folds under stress. We propose combinations of physical parameters as features for classification. Experimental results show that our proposed features achieved better classification performance than features derived from traditional methods.
منابع مشابه
Classification of speech under stress by modeling the aerodynamics of the laryngeal ventricle
We focus on variations in the aerodynamics of airflow patterns in the laryngeal ventricle and the false vocal folds based on a physical model for the classification of neutral and stressed speech. We modify the two-mass model to include the laryngeal ventricle, and the physical parameters characterizing airflow variations in the laryngeal ventricle under psychological stress are explored. The t...
متن کاملClassification of speech under stress based on modeling of the vocal folds and vocal tract
In this study, we focus on the classification of neutral and stressed speech based on a physical model. In order to represent the characteristics of the vocal folds and vocal tract during the process of speech production and to explore the physical parameters involved, we propose a method using the two-mass model. As feature parameters, we focus on stiffness parameters of the vocal folds, vocal...
متن کاملA Comparative Study of Gender and Age Classification in Speech Signals
Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...
متن کاملImproved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition
Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...
متن کاملEfficiency of seismic wave velocity and electrical resistivity in estimation of limestone rock mass quality indices (Q, Qsrm) (Case study: Asmari formation, SW Iran)
In this research, the relationship between P-wave velocity (Vp) and Electrical Resistivity (ER) parameters with rock mass quality indices is investigated; parameters such as rock mass quality classification (Q) and modified system for sedimentary rocks, known as Qsrm. For making predictive models, about 1200 data-sets extracted from sections drilled in Seymareh and Karun 2 Dam Sites (SDS and KD...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012